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ABSTRACT 

A GPSS model of the CP/CMS tune-sharing oonputer system at the 
Naval Postgraduate School was constructed, and was used in three 
experiments to investigate the performance of the system under a 
variety of conditions. In each of the experiments the model generated 
auto-correlated sequences of observations which were analyzed using 
techniques adapted from spectral analysis. 

An experiment to determine the effect of an increased number of 
terminals on average response time revealed that the system adequately 
could support a 25% increase in the number of terminals, and that the 
number of terminals was limited by the magnetic disk I/O capability. 

In a second experiment it was found that increasing the number of disks 
from four to eight would enable the system to support up to 20 terminals . 
A final experiment involving the examination of a new scheduling algo- 
rithm showed no significant changes in average response times. 
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I. INTRODUCTION 



The computer center of the Naval Postgraduate School operates 
a time-sharing system which currently supports 12 communication 
terminals. Anticipating a growth in demand for time-sharing services, 
there was an interest in deterrnining how many additional terminals 
could be supported without seriously degrading the performance of the 
system. Further, it is conceivable that the system response may be 
improved by modifying the important software algorithm used for job 
scheduling . 

A. THE PROBLEM 

1. Statement of the Problem 

This study was undertaken (1) to estimate the amount of system 
degradation which would accrue as a result of the additional load; (2) 
to examine alternatives to the present assignment of system resources; 
and (3) to test the effects of modification to the main scheduling 
algorithm. 

2. Importance of the Study 

The decision to apply an increased load to an established computer 
system requires seme knowledge of what effect that additional load will 
have on overall system performance. To make such a judgment purely on 
the basis of speculation or supposition is considered foolhardy at best. 
Therefore, it is intended that the results of this study will serve as an 
aid in making more informed decisions regarding proposed modifications to 
the system. 

'A. 
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3. Methods of Analysis 



Ideally, an experiment to test the effects of hardware modi- 
fications or software changes to a time-sharing computer system would 
be conducted in a laboratory atmosphere with the actual equipment on 
hand. Clearly, this would require much in the way of resources not 
the least of which would be the hardware and/or software used to 
measure system performance. In most instances, this method of analysis 
is prohibited by financial considerations alone. 

Analytical mathematical models may be considered the ideal 
analysis tool in some situations. However, the inpracticality of this 
method beccmes readily apparent in the event that the system being 
modeled is as complex as a modern time-sharing computer system. In 
order to construct an analytical model for which a solution can be 
obtained it is necessary to make a number of simplifying assumptions. 

The inaccuracies introduced into the model are difficult to estimate. 
Thus, while analytical models may serve as useful tools in studying 
seme aspects of computer systems, their usefulness in providing informa- 
tion about a specific complex time-sharing system is questionable and, 

i 

in cases such as this, computer simulation seems to be the logical 
approach. 

B. DEFINITIONS OF TERMS USED 

Definitions of frequently used terms are presented here for the 
convenience of the reader. 

1. Time-sharing system - a remote, multi-access data processing 
system which allows many users simultaneously to utilize the resources 
of the system. 

2. Core block - a set of 4096 consecutive bytes of main storage, 
the first byte of which is located at a storage address that is a 
multiple of 4096. 
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3. DASD - direct access storage device; for example, 2301 drum or 
2311 disk. 

4. FIFO - an acronym from First In First Out. 

5. Job - this term is used synonymously with "task" throughout 

this study to denote the processing required as a result of a ccmmand 
frcm a terminal. , 

6. Main storage - used synonymously with "core" to refer to storage 
directly addressable by the central processing unit. 

7. Page - 4096 consecutive bytes of program or data which may be 
located either in main storage or auxiliary storage. 

8. Paging - the transfer of pages of information between main 
storage and on-line auxiliary storage to enable a number of concur- 
rently active programs to share a limited amount of main storage. 

9. Reaction time - the time interval between the terminal's typing 
the first letter of a reply and the user's typing the last letter of 
a new catmand. This includes the time required to type the reply and 
the catimand as well as any additional time the user spends thinking. 

10. Response time - the time interval between the user's typing 

the last letter of a command and the terminal's typing the first letter 
of the reply [Ref. 10] . 

11. Virtual computer - a conceptual computer which is made to appear 
real by the control program. A distinct virtual computer is associated 
with each remote terminal. 

12. Virtual storage - the main storage of a virtual computer. Virtual 
storage does not necessarily reside in real main storage. 

13. Virtual address - an address which references virtual storage 
and must be translated into a real storage address before it is used. 

14. Dynamic address translation - the process of converting virtual 
addresses into actual main storage addresses during instruction execu- 
tion. 



C. ORGANIZATION OF REMAINDER OF TEE PAPER 

Section II presents a detailed description of the computer system 
under consideration. The model of the system is described in Section 
III which also contains discussion relating to validation and verifica- 
tion. Experimental and statistical considerations, with emphasis on 
the method of statistical analysis employed in this study, are presented 
in Section IV. Section V contains a summary of the results of the actual 
experiments which were performed on the model. Conclusions appear in 
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Section VI. Following the appendices, which contain statistical 
data, sample calculations, and flowcharts of the model, appear sample 
program output and program listings. 
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II. THE SYSTEM 



The time-sharing system discussed in this paper consists of an 
IBM System/360 Model 67-2 data processing system and its associated 
software. The software, known as CP/CMS, was developed at the IBM 
Cambridge Scientific Center and consists of two independent components: 

(1) the Control Program (CP-67) which manages the resources of the 
System/360 so that remote users appear to have a dedicated machine at 
their disposal; and (2) the Cambridge Monitor System (CMS), a con- 
versational operating system which enables users to interact with 
their programs from their terminals using relatively simple commands. 

A. HARDWARE 

The hardware configuration of the Model 67-2 system under consid- 
eration is diagrammed in Figure 1. A detailed description of the 
Model 67 may be found in Ref. 5. The card reader, punch, and printer 
are not shown or discussed in detail since all unit record I/O is 
spooled on disks by CP and not queued for physical I/O until requested 
by the user. The number of requests for physical I/O is very small and 
therefore their effect on system performance is assumed to be negligible. 

1. Processor 

The Central Processing Unit, CPU, is an IBM Model 2067-2 which 
contains facilities for (1) instruction decoding and execution; (2) per- 
forming arithmetic and logical operations; and (3) addressing up to 
eight 2365-12 Processor Storages. In order to function more efficiently 
in a paged time-sharing environment the 2067-2 is equipped with hard- 
ware implemented dynamic address translation. If dynamic address 



15 



1052-7 
CONSOLE 



2067 - 2 

CENTRAL 

PROCESSOR 



(CPU) 



2365 -12 

PROCESSOR 

STORAGE 



(CORE) 



2846-1 



CHANNEL CONTROLLER 



(CHAN2) 



\ 



2860-2 

SELECTOR 

CHANNEL 



I 



2870 

MULTIPLEXOR 

CHANNEL 



(CHAN A ) ' 



2820 




2841 




2702 


STORAGE 




STORAGE 




TRANSMISSION 


CONTROL 




CONTROL 




CONTROL 



2301 

DRUM 

STORAGE 




(DISKI) (DISK2) (DISKS) (DISK4) 



FIGURE I 

CP/CMS SYSTEM CONFIGURATION 



16 



translation fails because the referenced virtual page is not core 
resident a special interrupt, known as a page- translation exception, 
is recognized. 

2. Main Storage 

The main storage for the system consists of one IBM 2365 
Model 12 Processor Storage Unit which is directly addressable by the 
CPU and by the 2846 Channel Controller. The 2365 has a basic 750 nano- 
second storage cycle and accesses eight bytes (64 bits) in parallel. 

Each 2365 contains two independently accessible storage arrays of 128K 
bytes each. Since these two arrays can be accessed simultaneously an 
average cycle time of 375 nanoseconds is theoretically possible. The 
2365 has a total capacity of 256K bytes or 64 pages. 

3. Channels 

a. Channel Controller 

The IBM 2846 Channel Controller provides the ccranunication 
interface between the CPU and the channels, and also provides the data 
paths for control information and data transfers between main storage 
and the channels. It is the IBM 2870 Multiplexor Channel and the two 
IBM 2860 Selector Channels which actually execute the channel command 
word packages from main storage and, thus, control all I/O operations 
for their attached devices. In addition to their control functions 
the channels also provide the data path between their attached I/O units 
and the channel controller. 

b. Selector Channel 1 

Selector Channel 1 is dedicated to the IBM 2301 Drum Storage 
Unit. The 2301 is composed of 200 addressable tracks on which data is 
recorded in parallel groups of four bits. Each addressable track has 
four fixed-position read/write heads so that an entire track of 20,483 
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bytes can be read in one revolution of the drum. The drum makes a 
complete revolution each 17.5 milliseconds, giving a data transfer 
rate of 1,200K bytes per second. The total capacity of the drum is 
4.09 million bytes, or approximately 1,000 pages, 
c. Selector Channel 2 

Linked to this selector channel are four IBM 2311 Disk 
Storage Drives. Each disk contains 200 cylinders with ten tracks 
per cylinder. The access mechanism is constructed so that the ten 
read/write heads can access the ten tracks in any one cylinder with- 
out repositioning. Data is recorded serially by bits along a track. 
Therefore, it requires a full rotation of the disk (25 milliseconds) 
to read one 3,625 byte track. The total storage capacity of a 2311 
disk is 7.25 million bytes, or about 1,770 pages. Timing character- 
istics, shown in Table I, were obtained from Ref. 6 which contains a 
more detailed description of both devices. 

It should be noted that the timing figures given in 
Table I do not reflect the time which a read or write request must 
wait for a device or channel to become free. For instance, a request 
to read or write a record on one of the 2311 disks must wait for both 
the channel and the access mechanism, or arm, to become free. The 
channel then may initiate a seek for that arm. The seek time is the 
time required for the arm to be positioned at the appropriate cylinder. 
During this seek time the channel is free to service other requests or 
transmit data fron the other three disks. After the seek is completed , 
there may be another delay until the channel again is unoccupied. Once 
the channel is free, there is a rotational delay to position the hone 
address (record zero) of the track under the read/write head and an 
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TABLE I 



AUXILIARY STORAGE TIMING OIARACTERISTICS 



UNIT AND FUNCTION 


MEAN 


SPREAD 


2301 Drum 


Rotational delay 


8.75 


msec. 


0 - 17.5 msec 


Transmission time/page 


3.0 


msec. 


- 


Total 


11.75 


msec. 


3 - 20.5 msec 


2311 Disk 


Seek time 


74.25 


msec. 


25 - 135 msec. 


Rotational delay: 


to heme address 


12.5 


msec. 


0-25 msec. 


to record 


12.5 


msec 


0-25 msec. 


Transmission time/page 


26.0 


msec. 


- 


Total 


125.25 


msec. 


25 - 216 msec. 
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additional rotational delay to position the desired record under the 
head. The final delay is the time actually required to read or write 
the record, known as transmission time. The lengths of the waiting 
times mentioned are functions of the rate of reference to the disks 
and have been the subject of both simulation studies [Ref. 10] and 
queueing analyses [Refs. 1 and 10] . 

d. Multiplexor Channel 

The IBM 2870 Multiplexor Channel supports the low data 
rate I/O devices of the system. It concurrently can sustain I/O 
operations on several I/O devices by interleaving bytes of data 
associated with different devices and routing them to or from the 
proper locations in main storage. The I/O devices of most interest 
in this study are the 12 IBM 2741 Communication Terminals which inter- 
face with the multiplexor channel through an IBM 2702 Transmission 
Control Unit. These terminals are IBM Selectric typewriters with 
added electronic controls for camnunication with the computer system. 

In the time-sharing environment the terminals become the operator's 
consoles for the virtual machines, and from than the users can control 
the execution of their programs. 

4. DASD Allocation 

In the present system configuration the drum is used entirely 
for paging and can accommodate virtual storages for 13 to 15 users 
depending upon blocking factors and the number of system pages which 
are made drum resident. Disk number one is the system disk and contains 
about 500 pages of system programs including compilers, assemblers and 
macro-libraries. Disk two is reserved for paging of any virtual storages 
which cannot fit on the drum. Disks three and four are used to store 
user files, to spool I/O, and to store scratch files. 
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B. SOFTWARE 



The two major components of the CP/CMS time-sharing software system 
are the Control Program, CP-67, and the monitor systan, CMS. CP creates 
the time-sharing environment enabling many users to simultaneously per- 
form work, while CMS produces the conversational atmosphere which en- 
ables users to interact with their programs. Each component is capable, 
of operating independently on an appropriate System/360 configuration. 
Thus, CP could be used without CMS to construct a non-conversational 
time-sharing system; and, similarly, CMS could be used without CP to 
produce a conversational operating system without time-sharing capa- 
bilities. When the two are used together, in order to take full 
advantage of the features of both, CMS operates under the supervision 
of CP. Since CP is responsible for creating and managing the time- 
sharing environment its structure and operation are considered in more, 
detail. A complete description of CP-67 and CMS is available in Ref- 9. 

CP constructs and maintains virtual computers which are indistin- 
guishable from real computers to both the user and his programs. When- 
ever a user activates a terminal, CP creates for his personal use a 
virtual computer system of predefined configuration. Since the systems' 
are virtual their configurations need not correspond to either the real, 
system or each other. In the system studied, the normal virtual con- 
figuration included a Model 65 CPU, 256K bytes of main storage, three 
disk drives, and a console, the user's 2741 terminal. To each user 
his virtual system appears real and he uses it as if it were real. 
Actually, the resources of the real computer system must be shared 
among the virtual computer systems of all concurrently active users. 
Thus, CP must schedule and allocate real resources to the various 
virtual computers. 
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1 . Execution Control 



One resource which must be shared is real CPU time. The main 
dispatcher and control routine, DISPATCH, of CP is responsible for 
allocating the execution time in the CPU among the virtual computers. 
After the processing of each interrupt, control is passed to DISPATCH 
which charges time used within the control program to the appropriate 
user and performs other "housekeeping" functions. DISPATCH then 
determines if the interrupted user has run to the end of his time slice 
and, if he has, lowers his priority one level before starting the high- 
est priority runnable user. If the user has not completed his allotted 
time slice and there are no higher priority runnable users, he then is 
restarted with the remaining portion of his initial time slice. How- 
ever, if a higher priority runnable user is found, that user's time 
slice is computed and he is started. 

The time slice assigned each user task is determined, before 
it is started, on the basis of priority as indicated in Table II. 

TABLE II 

TIME SLICE AS A FUNCTION OF PRIORITY 



PRIORITY 
10, 9, 8 
7, 6, 5 
4, 3 
2 , 1 



TIME SLICE 
50 msec. 
100 msec. 
150 msec. 
200 msec. 



Users are placed at the highest of the ten priority levels whenever 
they complete a terminal interaction. When a user is started, his 
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time slice, or the remaining portion, is loaded in the interval timer, 

so that a timer interrupt signals the completion of a time slice. The 
purpose of lowering the priority of a user who completes a time slice 
is to prevent "compute bound" users from monopolizing the CPU. However, 
to ensure that such users do not fail to run at all, each 15 seconds 
DISPATCH locates all runnable users who have not run during the preceding 
15 seconds and raises than one priority level. 

2. Dynamic Adjustment of Paging Load 

In addition to execution control, DISPATCH is responsible for 
the dynamic adjustment of the system paging load. Each time DISPATCH 
is entered the following information is recorded: 

1. Time the CPU was idle with outstanding page requests 
(page-blocked idle time) . 

2. Time spent executing CP. 

3. Time spent executing problem programs. 

Every 60 seconds the ratio of page-blocked idle time plus CP time 
plus problem time to page-blocked idle time is computed . This ratio 
provides a measure of the system paging load. If this ratio is greater 
than five, indicating a low paging load, the value of PAGMUT (maximum 
number of concurrent paging operations allowed) is increased by one. 

If however, this ratio is less than twc, the paging load is undesirably 
high and, therefore, PAGMUT is decreased by one. This adjustment is 
subject to the limitation that PAGMUT must always be greater than or 
equal to two. 

3. Main Storage Management 

Another resource of the real system which must be shared among 
the virtual systems is main storage. To facilitate this process the 
256K bytes of main storage are divided into 64 equal 4K blocks, each 
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of which can hold one page of program or data. Seventeen of these 
core blocks are permanently occupied by CP-67 itself, and another 
seven are filled with the nucleus of CMS, leaving 40 blocks avail- 
able for sharing among virtual machines. These core blocks are 
shared using the technique known as paging. 

If a user program (any program other than the supervisor, 

CP) references a page which is not currently core resident, dynamic 
address translation will fail and a page-translation interrupt will 
occur. This paging interrupt results in a call to PAGTRANS, the page 
handling routine. PAGTRANS first determines if a new paging operation 
can be initiated. The value of PAGMOT is compared with PAGARP, the 
number of current paging operations. If PAGARP is greater than or 
equal to PAGMOT no new paging operations may be initiated and the page 
request is added to the queue of pending page requests, PAGMUTQ. 
(Servicing this queue is one of DISPATCH'S "housekeeping chores".) 
Control is then returned to DISPATCH. If, however, PAGARP is less 
than PAGMOT the value of PAGARP is incremented and the paging opera- 
tion is begun. 

PAGTRANS then must locate an available core block into which 
the required page may be read. This is accomplished by examining the 
core table which contains an entry for each of the 64 core blocks in- 
dicating the status of the page currently occupying that block. Blocks 
are selected for paging subject to the following constraints: 

1. No block which is locked (i.e., affected by a pending I/O 
operation) may be selected. 

2. No block which is "in transit" (i.e., affected by a pending 
paging operation) nay be selected. 

3. No block having a FIFO flag set may be selected. FIFO 
flags are set whenever a page is read into a block. When all 
FIFO flags are found to be set they are all reset immediately. 
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Blocks meeting the constraints listed above are selected in the 
following order of priority: 

1. Non-valid (i.e., currently empty) 

2. Unused and unchanged 

3. Unused and changed 

4. Used and unchanged 

5. Used and changed. 

When an available core block is located, the transit, valid and FIFO 
flags are set in the corresponding core table entry. 

If the changed flag is set in the selected core table entry, 
the page currently occupying that core block must be written into its 
direct access storage device location before the new page can be read. 

This process is known as page "swapping" . PAGTRANS determines the 
appropriate DASD address, generates an I/O task block to write the 
page, and adds this block to the queue of I/O task requests. After 
this is completed, or if it were unnecessary, PAGTRANS creates an I/O 
task block to read the requested virtual page from its DASD location 
and queues this block for execution. Control then is returned to 
DISPATCH. 

When the requested page is read into its core block the transit 
flag is cleared from the corresponding core table entry and the requesting 
user again is made runnable. 

4. I/O Management 

All virtual computer I/O operations must be translated into real 
I/O operations and scheduled by CP. Since CP/CMS is a disk file oriented 
system and because all unit record I/O is spooled on disk by CP, virtually 
all input and output may be considered to be disk I/O. 
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All I/O instructions are privileged; that is, they may be 
executed only by a supervisory program. Consequently, any attempt 
by a virtual machine to execute an I/O instruction leads to a 
privileged operation interrupt and a transfer of control to CP. CP 
translates virtual device addresses into real device addresses, virtual 
storage addresses into real storage addresses, and virtual channel com- 
mand lists into real channel conmand lists. It also locks the affected 
page(s) into core, places the requesting virtual computer in an IOWAIT 
status, queues an appropriate I/O task block for execution, and returns 
control to DISPATCH. When CP receives an interrupt indicating the 
completion of an I/O operation it unlocks the affected page(s) , makes 
the requesting user's virtual machine runnable, and returns control 
to DISPATCH. 
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III. THE MODEL 



A. MODEL DESIGN 

A flexible GPSS model was constructed within which changes in 
system configuration could easily be incorporated . To accomplish 
this required a rather large number of functions and variables, the 
use of which resulted in a corresponding increase in the execution 
time of the program. However, it is felt that the benefits which 
arise as a consequence of being more general far outweigh the dis- 
advantage of somewhat longer execution times. This is particularly 
true when an experiment is being conducted which involves the 
examination of a number of alternatives. In this case it is desir- 
able that changes be accomplished merely by modifying sane parameters. 

The model also was constructed in a modular fashion. Those 
routines which simulate DISPATCH, PACTRANS, and IQEXEC, for example, 
are essentially separate sections. Thus, to reflect changes in the 
paging algorithm it is necessary to modify or replace only that section 
of code which deals with paging. 

Depending upon the system configuration and the job mix, an execution 
ratio of between one-to-one and one-to- three was obtained. Thus, one 
minute of 360/67 time was needed to simulate one to three minutes of 
time on the system being modeled. Using results provided by Nielsen, 
and others, in similar studies it is judged that this ratio is not 
unacceptable [Ref. 11] . To assist in reducing overall execution times, 
user chains were provided in appropriate locations throughout the program. 
In addition, an experiment was performed to determine precisely how much 
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time could be saved by significantly reducing the amount of coding 
in the paging section, one of the lengthier sections in the program. 

This is discussed more fully under "level of detail", later in this 
section. 

B. GENERAL DESCRIPTION 

A GPSS program consisting of 249 blocks, 47 variables, and ten 
functions comprises the model of the time-sharing system. The program 
consists of the following sections: 

1. Control information 

2 . Function description 

3 . Variable description 

4. Timing, accumulation of statistics, output, and simulation 
of DISPATCH routines 

5 . Pre-execution 

6 . Execution 

7 . Post-execution 

8 . Miscellaneous routines 

9 . Input/Output 

10. Paging. 

The operation of each of these parts is discussed in detail later. 
Functions and variables describing the distributions of (1) execution 
time; (2) program size; and (3) user reaction time were obtained frctn 
a study by Scherr at the Massachusetts Institute of Technology [Ref. 12] . 
In obtaining these empirical distributions, Scherr used an extremely 
large sample size which lends seme measure of credence to the statistics. 
The supposition that these distributions were valid for use in this study 
was based upon the consideration that one would not expect the character- 
istic requirements of programs written at M.I.T. to be vastly different 
frem those written at this school, since both are educational environ- 
ments where the majority of programmers are students. 
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C. JOB DESCRIPTION 



Jobs resulting from terminal interactions are represented by 
"transactions", temporary GPSS entities. Jobs are completely 
described by means of their parameters, which are explained in 
Table III. 

Apart from those created for special purposes, such as interrupt 
processing and timing, the number of transactions, and hence the number 
of jobs active in the system, is equal to the number of terminals. 

That is, it was assumed that all terminals had "logged on" and were 
in use. Clearly, this situation will not always be prevalent in the 
real system; however, it was decided to err on the side of conservatism 
and portray the systen under the most demanding conditions in order to 
ensure that any conclusions about system performance were based upon 
"worst case" estimates. 

Jobs have exponentially distributed execution times. For all runs, 

40% were designated as "large" jobs with a mean execution time of 2,000 
milliseconds and 60% were considered "small" jobs having a mean execu- 
tion time of 25 milliseconds. This results in the hyper-exponential 
distribution which has been used in other studies of this type. [Ref. 13] . 
At most, two GPSS cards need be modified to alter either the job mix or 
the mean execution times. 

D. PAGE ACCOUNTING 

Two 40 word arrays were established to keep track of every page in 
real core. These arrays, with typical contents, are illustrated in 
Figure 2. Array A, halfword savevalues 1-40, contains an up-to-date 
list of all pages currently in real core, exclusive of the 24 blocks 
assigned to the core resident portions of CP/CMS. Page numbers identify 
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TABLE III 



PARAMETER 

1 

2 

3 

4 

5 

6 

7 

8-9 

10 

11 

12 



JOB PARAMETER DESCRIPTION 



DESCRIPTION 

Indicates the time remaining before the 
next scheduled request for a page. 

Indicates the time remaining before the 
next scheduled request for I/O. 

Contains the time remaining in the time 
slice currently allotted. 

Initially, contains the total CPU tame 
required for completion of the job. This 
number is decremented by an appropriate 
amount each time the job gains control of 
the CPU. 

Contains the length of a time slice which 
is determined on the basis of priority. 

Contains the remaining time the job was 
scheduled to have control of the CPU when 
interrupted. This number is used to up- 
date parameters one through four to reflect 
the current status of the job. 

Contains the terminal number. 

Pointers for arrays and logic switches. 

Points to one of the first four parameters 
and, by indirect addressing, is used to 
determine the amount of time a job will spend 
in the CPU (ADVANCE P*10) . This parameter 
also is used as an index upon completion of 
a unit of processing to select the correct 
destination for the job. 

Contains either a one or a zero and is used 
to adjust the priority of a job to its correct 
level . 

Used to record the correct priority of 
a job. 
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TABLE III (continued) 



PARAMETER 

13 

14 

15 

16 

17 

18 

19 

20 

21 

22 

23 

24 

25 

26 



DESCRIPTION 

A pointer to one of two variables which, 
initially, are used to select the amount 
of CPU time a job will require. 

A pointer used to designate the I/O 
channel for the job. 

A pointer to a savevalue location. 

Contains either a one or a two. Used to 
indicate the number of times a job will 
loop during the paging operation. 

Indicates the program size of a job. 

A pointer to the core status table. 

Contains the number of the page which 
currently is required by the job. 

A counter used in looping to reset the 
FIFO indicators. 

A pointer to the core status table. 

A pointer to one of two variables. Used 
to select the time to next page request. 

A pointer to one of two variables. Used 
to select the time to next I/O request. 

Contains the number of the disk being 
referenced . 

A pointer to the core status table. 

Contains either a one or a two which 
indicates in IOWAT whether the job is 
doing I/O or paging from a disk. 
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HALFWORD 

SAVEVALUE CONTENT 

LOCATION 



HALFWORD 

SAVEVALUE CONTENT 

LOCATION 
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1205 


41 


1 

10001 


2 


113 


42 


00011 
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707 


43 


10101 
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1209 


44 


00011 
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• 
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• 
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39 


1202 


79 


10011 


40 


801 


80 


10001 
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both the user and a particular page for that user. As an example, 
the entry 1205 refers to the fifth page for the user on terminal 12. 

It was assumed that the user's requirements would not exceed 99 pages; 
otherwise fullword savevalue arrays would have been required. Array 
B, halfword savevalues 41-80, contains the status of the associated 
page in array A; that is, a sequence of zeroes and ones whose location 
signifies a particular indicator and whose value represents whether 
or not the indicator has been set (O-reset, 1-set). The indicators, 
fron left to right, are: 

1. Locked. A one in this position indicates the page has been 
locked in core, which implies that this particular core location 
may not be used for paging at this time. 

2. Transit. A one signifies that the page is in transit from a 
DASD. This block of core may not be used for paging until the transit 
indicator is reset. 

3. FIFO. FIFO indicators are set when a page is read into that 
core block. Once all FIFO indicators have been set, they are reset 
immediately. No core block having the FIFO flag set may be selected 
for paging. 

4. Changed/Unchanged. This indicator is used to indicate whether 
or not a particular page has been altered during execution. If this 
indicator is set, the page must be written on its DASD before another 
page can be read into its core block. A reset indicator permits a 
read-only operation since an exact copy of the old page already resides 
on its DASD. 

5. Valid/Non-valid. A reset indicator means the core block is 
unoccupied. 



E. DETAILED MODEL DESCRIPTION. 

A detailed explanation of the individual GPSS instructions is 
contained in the GPSS User's Manual [Ref. 7]. 

1. Control Information 

This section of code is comprised of the RMULT card and the EQU 
cards. The RMULT card serves the purpose of initializing the seeds for 
the eight pseudo-random number generators, thereby minimizing the 
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possibility of spurious effects due to periodicity which otherwise 
might occur as a result of the eight generators producing the same 
sequence of numbers. 

The BQU cards enable the programmer to use symbolic names, 
in place of numbers, in the operand fields of instructions when 
referring to GPSS entities. This feature not only maxes the program 
itself more readable but also enhances the appearance of output data. 

2. Function Description 

a. NRPAG 

This function is used in conjunction with the variable SIZE 
to assign to each job a program size of 1-57 pages. The mean and median 
of this distribution are approximately eight pages and two pages respec- 
tively. 

b. REACT 

This function is used in conjunction with the variable THINK 
to assign to each job a user reaction time. This distribution has a mean 
of 35 seconds and a range of 0-5 minutes. 

c. CPOTI 

This function points to one of two variables, CPUHI or CPULO, 
which designate the amount of CPU time a job will require. 

d. PAGTI and IOTIM 

These functions are used in a manner similar to CPUTI and 
assign to jobs a length of time which is interpreted as the interval 
between successive requests for pages and I/O respectively. 

e. PGDSK 

This function assigns to each job a disk for paging and is 
unused with less than 16 terminals active. 
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f. DISK 

The function DISK selects a disk for each I/O request. 

g. SEEK 

This function describes the distribution of seek times for 

a 2311 disk. 

h. EXPON 

EXPON is the standard exponential distribution. 

i. TIMER 

This function is used with the variables TIMES and BTS to 
provide each job with a priority dependent time slice. 

3. Variable Description 

Variables are described by ccmments in the program listing. 

4. Timing, Accumulation of Statistics, Output , and Simulation of 
DISPATCH Routines 

A transaction is created every 15 seconds. After counters have 
been incremented, the 15-second check of runnable users is performed. 

Upon completion of this check, the contents of appropriate savevalue 
locations are updated. These savevalues are used to maintain information 
relating to (1) execution, overhead, and idle percentages; (2) paging 
rates; and (3) problem mode time, control program time, idle time, and 
page-blocked idle time. Once the warm-up period has elapsed, appropriate 
statistics, such as facility utilizations and queue lengths, are printed 
every 15 seconds. At the block labeled MINUT a test is made to see if 
the 60-second check should be performed. Clearly, every fourth transaction 
will cause this check to be made. Finally, the transaction is terminated 
and the termination count is decremented by one. Two features of this 
section of code should be noted. First, no blocks exist which, in any way/ 
can delay the transaction and, as a result, the relative and absolute clocks 
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are not advanced. Second, the TERMINATE block at the end is one of 
only two places in the entire program where the termination count is 
decremented; the other being at the block labeled GOODG which is not 
accessible until sufficient statistics have been gathered to end the 
run. 

5. Pre-processing 

A transaction is created for each terminal on the system. These 
transactions symbolize "jobs" to the computer system and circulate through- 
out the main sections of the program simulating processing, creating inter- 
rupts, and initiating requests for pages and I/O as their parameters 
dictate. There are two large loops within the main program. The outer 
loop commences with the block labeled DOIT and ends at one of several 
blocks in the post-execution phase. Each job traverses the outer loop 
only once. Each transaction which leaves the ADVANCE block at DOIT repre- 
sents a new job and frcm that time until its processing is completed 
circulates in the inner loop which commences at the block labeled EXEC 
and ends at a number of locations depending upon the status of the job. 

Upon departing the ADVANCE block at DOIT, transactions set a logic 
switch which is used to indicate that they are not eligible for a priority 
increase. The status of the logic switch is examined during the afore- 
mentioned 15-second check. Jobs then are assigned (1) the amount of CPU 
time required for processing; (2) the program size in pages; (3) I/O and 
paging rates; (4) times to first I/O and page requests, and (5) a basic 
time slice of 50 milliseconds. 

After pointers to logic switches have been initialized, the job 
is assigned a channel for I/O and disk paging. An interrupt then is 
created as a result of the terminal interaction and the priority of the 
job is set equal to the datum, or reference level, of 100. It should be 
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noted that all transactions which simulate interrupts are created with 
a priority of 125 to ensure their prompt processing. (Access to the CPU 
is made on the basis of priority.) The BUFFER block restarts the Over- 
all GPSS/360 Scan in order to handle the interrupts which, otherwise, 
would not be processed until the status change flag was set to ON. The 
operation of the Overall GPSS/360 Scan is described in the GPSS User’s 
Manual [Ref . 7 ] . A MARK block then is encountered which records the 
time the job was created. The combination of this MARK block and a 
succeeding TABULATE block with Ml in operand field A serves to compute 
the transit time for the job. 

6. Processing 

The execution phase, as well as the inner loop of the main program, 
commences with the block labeled EXEC. The next higher priority level is 
recorded and will be used in the event the job qualifies for a priority 
increase during the next 15-second check. One of the conditions for a 
priority increase then is established by setting a logic switch to indicate 
that the job is "runnable". This switch is reset immediately when a job 
obtains control of the CPU. 

Jobs compete for and gain control of the CPU on the basis of 
priority. Once the CPU is obtained, a test is made to determine whether 
or not the current job was previously interrupted. If so, the job's 
priority is lowered by one to offset the temporary priority increase 
received at the time of interruption. That temporary priority increase 
ensures that the interrupted job will be given preference over all other 
jobs in its priority class when it returns to the queue for the CPU. 

A check then is made to decide whether or not the job's desired 
page is in real core. If so, processing is allowed to continue; if not, 
the job relinquishes control of the processor and transfers to the paging 
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section to initiate a request for the page. Next, an arbitrary per- 
centage of jobs indicate that their current page will be changed during 
execution which necessitates swapping once a request is made to bring 
another page into that particular block of real core. This percentage 
was maintained at 75% throughout all runs. 

A priority dependent time slice then is computed and assigned 
to the job with the provision that this job is not the last interrupted 
user. If so, the job continues to process with the remaining portion 
of the time slice it had then interrupted. 

The contents of parameters one through four are retained in save- 
values so that they may be adjusted after a unit of processing has been 
completed. These parameters contain, in order, (1) time to next page 
request; (2) time to next I/O request; (3) time remaining in the current 
time slice; and (4) overall CPU time remaining before the job is completed. 
The minimum of these four times is selected and the job enters the ADVANCE 
block with this time. If the job is interrupted prior to taking a normal 
exit from the ADVANCE block it is sent to the block labeled HOLD, which 
will be described later. If, however, the job is not interrupted, problem 
mode time and parameters one through four are adjusted to reflect the 
amount of time the job had control of the CPU. Control of the processor 
then is relinquished. 

7 . Post-processing 

This section of code begins with the block labeled NEXTB. In order 
for a transaction to arrive at this location at least one of its first four 
parameters must be equal to zero. The number of this zero-valued parameter 
is contained in parameter ten. The first TRANSFER block sends the trans- 
action to one of the next four blocks depending upon the value of parameter 
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ten. Three of these four blocks are unconditional transfers to various 
routines which handle such matters as paging and I/O. If the trans- 
action is sent to the fourth block this signals completion of its 
required processing and an interrupt is created following the priority 
increase. If the stabilization period is not over, jobs are sent to 
DOIT and begin the cycle over again; otherwise transit time is tabulated 
in the TOTAL table before the transfer to DOIT. 

Average transit time is calculated and printed for every five 
jobs. Finally, if sufficient statistics have not been gathered the 
transaction is sent back to DOIT. If the specified sample size has been 
reached, the TOTAL table is printed and all remaining transactions are 
terminated inmediately by means of the blocks labeled COMPL and GOODO, 
thus completing the run. 

8. Miscellaneous Routines 

Four miscellaneous routines will be discussed. 

a. DEMOT 

This routine, commencing with the block labeled DEMOT, is 
entered from the post-execution phase when parameter three equals zero 
indicating the job has completed a time slice. The priority of the trans- 
action inmediately is lowered one level, a transaction is created which 
simulates a timer interrupt, and the job is sent back to the queue for 
the CPU. 

b. HOLD 

This routine commences with the block labeled HOLD and is 
entered by transactions which have been interrupted during execution. 

The source from which the transaction came is determined by a TEST block, 
the test being made on the basis of priority. If the transaction came 
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frcm the IDLER routine, page-blocked, idle time is adjusted accordingly 
and the job is sent back to IDLER. Otherwise, problem mode time is 
adjusted together with parameters one through four-, the transaction's 
priority is temporarily raised, a flag is set to indicate this job as 
the last interrupted user and the job is sent back to the queue for the 
CPU. 

c. INT 

This routine simulates the processing of interrupts created 
by a SPLIT block. Interrupts are handled on a FIFO basis. After control 
of the CPU has been obtained the transaction advances the length of time 
required to process the interrupt, adjusts control program time, relin- 
quishes the CPU, and terminates without decrementing the termination 
count. 

d. IDLER 

A single transaction with a priority of one is generated 
at the start of the run. This transaction initializes the value of 
PGMUT to ten. PGMUT, the maximum number of pages allowed in transit, 
is adjusted dynamically during program execution by means of the 60 - 
second check. This transaction then cycles back and forth between the 
HOLD routine and the IDLER routine. The only function of the IDLER 
section is to maintain a record of page-blocked idle time; that is, the 
time that the CPU is idle during which at least one page request is out- 
standing and remains to be honored. This is accomplished by means of the 
TEST GE, GATE NU, and TRANSFER blocks which require that two conditions 
be satisfied simultaneously before this transaction with its low priority 
is allowed to gain control of the CPU. Once these conditions are met, the 
transaction obtains the CPU and attempts to advance for essentially an 
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unlimited length of time. However, the amount of activity in the model 
prohibits normal exit from the ADVANCE block. When this transaction is 
interrupted it will be sent to HOLD where page-blocked idle time will 
be adjusted. This cycle is repeated until the run is completed. 

9. Input/Output 

This section of code, commencing with the block labeled IOWAT, 
is entered frcm the post-execution phase when parameter two equals zero 
indicating a request for I/O. A flag is set to denote an I/O request. 

This is necessary because some of the code in this section also is used 
by jobs which are paging from a disk. After creating an interrupt, a 
new time is obtained to the next I/O request. The appropriate page is 
locked in core, to be unlocked only upon completion of I/O. Then, a disk 
is assigned and the time required for the I/O to be accomplished is placed' 
in parameter 13. Jobs then queue for the disk, or arm, and wait for the 
channel to become free. Once the channel is obtained a seek is initiated. 
Upon completion of the seek the job re-enters the queue for the channel. 
When control of the channel is obtained once more the I/O request is 
honored. The value of parameter 26 is tested to determine the source of 
the transaction. If the transaction was paging frcm a disk it is returned 
to the block labeled TEST in the paging section. If the transaction was 
doing I/O the page is unlocked and, after an interrupt is created, the 
job is sent back to queue for the processor. 

10. Paging 

This section of code commences with the block labeled PAGEW and 
is entered from either the execution or the post-execution phase. Due to 
the many options which exist in this section of code only one case will be 
described in detail. An atterrpt to explain all the possible situations 
that may arise would result, it is felt, in a mass of confusion. The case 
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which will be discussed is typical; a job requests a page which is 
not in real core, a block of real core is available for paging but 
the old page has been changed and must be swapped, and the requested 
page resides on the CMS disk. Alternatives to the case described can 
be understood by examining the source code in combination with the 
flowcharts . 

Initially, the job receives a new time to the next page request 
and sets a flag indicating that it is paging, as opposed to performing 
I/O. After an interrupt is created, the number of the desired page is 
calculated and recorded to facilitate the re-checking which is required 
in the execution phase. Next, the core status table is scanned by means 
of the SELECTE block to determine if the requested page already is in 
core. Under the assumption that it is not in core the exit to FNDPG is 
taken. 

At FNDPG a check is made to ensure that the maximum number of 
pages are not already in transit. If not, this number is incremented 
and the job examines the core status table by means of the SELECTMIN 
block to locate the most desirable core location for paging. Then, the 
status of that core block is settled by means of the two TEST blocks. 
Assuming that no problems exist the exit to OKGO is taken, at which point 
the test is made which indicates that the old page had been changed. 
Parameter 16 is set equal to two implying that two steps are required. 
First, the old page must be written onto the drum or disk, the determina- 
tion of which is made in the next TEST block. Second, the requested page 
must be read into core. Assuming that the old page was obtained from the 
drum the exit to NORM is taken. The job then queues for the drum sel- 
ector channel, writes the old page onto the drum once the channel is 
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seized, releases the channel, and transfers unconditionally to the 
block labeled TEST. The time spent writing the page onto the drum 
is uniform over the interval 3-21 milliseconds, a time interval which 
is characteristic of the 2301 drum. (See Table I) 

At TEST an interrupt is created and the value of parameter 16 
is shown to be two by the TEST E block. This number is decremented 
indicating that half the cycle of swapping has been completed. 

The TRANSFER block arbitrarily designates 10% of the requests 
as CMS page requests. This percentage also was maintained constant 
throughout all runs. The number of the CMS disk is assigned to parameter 
24 and an unconditional transfer is made to DISK+1. Disk paging time is 
calculated and the transaction then is sent to DSKIO in the I/O section 
for processing. Once the desired page has been read into core, the 
transaction returns to the block labeled TEST, at which time another 
interrupt is created. The value of parameter 16 is now one and, hence, 
the transaction is sent to the block labeled DONE. 

At DONE the number of pages in transit is decremented by one, 
the scan is restarted, the transit indicator is reset and the job returns 
to EXEC to queue for the CPU. 

Several special cases should be noted. 

a. Core Block Unavailable for Paging 

If the scan of the core status table indicates that all pages 
in core have been locked or designated as "in transit" the job is sent to 
a FIFO user chain, PROB, to await removal of one of these conditions. 

b. Page Already in Core 

If the requested page already is core resident then only that 
section of code between PAGEW and FNDPG will be executed. 
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c. All FIFO Indicators Set 



If all FIFO indicators have been set the transaction, 
before continuing to be processed, resets all the indicators. This 
is accomplished by means of the three blocks of code starting with 
REFBT. 

d. Maximum Number of Pages in Transit 

If the desired page is not core resident and the number 
of pages in transit is already at the maximum permitted the job will 
be sent to another FIFO user chain, PGMTQ, to wait until some job has 
finished paging and decrements the number of pages in transit. 

F. USING THE MODEL 

An example of the output obtained from the GPSS program each 15 
seconds of simulated time is included with the computer output. The 
first statistics presented are those concerned with CPU utilization 
and paging activity, and are represented as contents of fullword save- 
values . 

The first three savevalues are titled appropriately and give the 
percentage overhead, percentage execution, and percentage idle for the 
CPU. These figures are truncated to tenths of one percent. PAGIN and 
PGOUT give the pages per second transferred into and out of core, re- 
spectively. Next, the standard GPSS facility and queue statistics are 
presented for the I/O channel (CHANA) , the CMS disk (DISKI) , the paging 
disk (DISK2) , the I/O disks (DISK3 and DISK4) , the processor (CPU) , and 
the drum and its channel (CHAN2) . Finally, the average response times, 
expressed in milliseconds, for two groups of five jobs are printed as 
the two values of savevalue location 322. 
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The next two pages of computer output contain exanples of standard 
GPSS table output. During preliminary runs much use was made of data 
tabulated in this form. The first table presented, PGS, contains an 
observed distribution of program size in pages. The second table pre- 
sents an observed distribution of 2311 disk seek times. 

For a detailed explanation of the standard GPSS output see Ref. 7. 

G. LEVEL OF DETAIL 

In general, a model is not intended to reflect the most minute 
detail of the real system it purports to represent? time considerations 
alone seldom permit this. Consequently, delays in response due to dynamic 
address translation, for example, were not included in the model since, 
as Nielsen points out, "These functions are an order of magnitude finer 
in detail than the activities at the paging level." [Ref. 11]. 

It is essential, therefore, to extract from the real system those 
factors and relationships which have a significant effect upon the per- 
formance measure being examined and to disregard all unimportant details; 
keeping in mind, however, that what is deemed unimportant in one study 
may have vast implications in another. 

Since the system being modeled is page oriented and since the operations 
of paging and page-swapping are vital factors in determining systsn response 
time, some provision was needed to keep track of every page in real core at 
all times. An explanation of the technique used is contained in a previous 
discussion on page accounting. 

Because of the relatively large number of paging and I/O operations 
that take place over a small interval of time it was decided to use access 
times based on the probability distributions of seek time and rotational 
delay for the device in question rather than attempt to maintain a record 
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of the actual location of each page on that device. As they exist 
now, the paging and I/O sections of the model are quite lengthy. To 
complicate these sections further would only result in a poorer 
execution ratio without, it is felt, providing any additional informa- 
tion on system performance. The distribution of disk seek times was 
obtained from Ref. 1 and assumes that all cylinders have an equal 
probability of being referenced. 

An experiment was conducted to measure the amount of execution 
time that could be saved by replacing the paging section with a reduced 
amount of coding which, hopefully, would create the same effect on over- 
all system performance. To accomplish this, a pilot run was made to 
gather statistics on (1) the percentage of jobs whose desired page was 
already in core at the time a page request was initiated; and (2) the 
amount of time jobs spent paging when the desired page was not in core. 

By means of a TRANSFER block using the fractional mode, jobs, upon 
entering the paging section, were either sent back to the queue for the 
CPU (page in core) or allowed to page (page not in core) . If paging 
was required the job entered an ADVANCE block whose field A argument 
was a function, the distribution of which was obtained in the pilot run. 
Upon departure from the ADVANCE block, jobs were sent back to the queue 
for the CPU. Thus, all that code relating to the maintenance of the core 
status tables was eliminated. 

The effect of this seemingly major change was unexpected. Although 
the performance of the system was unaltered measurably, only one minute 
of execution was saved frcm a total of 25. Now, the saving of one minute 
of 360/67 time is not to be taken lightly and if there had been sufficient 
evidence to indicate that the statistics obtained from the pilot run were 
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applicable under all conditions this change would have been incorporated 
in the model. In order to acquire such evidence, however, additional 
pilot runs would have been necessary, resulting in an increase in the 
total computer time required to complete this study. Therefore, it was 
decided to return the code which had been removed and proceed with the 
study as originally intended. No further attempt was made to condense 
any portion of the program. 

By examining block counts it was concluded that a large share of 
the execution time was spent either processing simulated interrupts or 
accumulating statistics. Unfortunately, interrupts occur individually 
and at rather unpredictable, although deterministic, times. Thus, each 
interrupt is a separate entity and, as such, must be processed as an 
individual task. 

The standard set of GPSS statistics which is maintained automatically 
by the system can be obtained easily; so easily, in fact, that it is often 
more difficult to suppress this information than to acquire it. However, 
rather devious means must be employed to acquire statistics beyond those 
normally produced, especially if the statistic does not lend itself to 
tabulation in a table. The use of these unwieldy procedures consumes 
much execution time. 

H. VALIDATION 

Validation, is the process of illustrating, by means of various sta- 
tistical tests, that the behavior of the model conforms to the behavior 
of the real system under an equivalent load. That is, with a given system 
configuration and an identical input, or job mix, one tries to show that 
the variates which reflect some measure of performance in the model are 
drawn from the same distribution as those which reflect identical measures 
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of performance in the real system Statistical tests such as 

goodness of fit may be employed . 

The claim that a model is valid remains only conjecture until 
such time as statistical evidence is available to support that claim. 

The fact that the model, under a variety of tests, acts reasonably 
well can only be interpreted as encouraging. That is, at least the 
model did not yield any results which are known to be impossible to 
achieve in the real system. Whether or not the results are truly 
representative is another matter. 

The dilemma that one faces, of course, is determining how much 
statistical evidence is required. Some may argue, incorrectly it is 
felt, that no amount of information will suffice to show validity. 

However, enough statistical evidence must be obtained to show that 
the model is adequate for the purposes of the study; adequate in the 
sense that those aspects of system behavior which are relevant to the 
study are reflected by the behavior of the model. 

Unfortunately, it was not possible to even begin to validate the 
model used in this study, although an attempt was made to secure system 
performance figures from four different sources. Only one of these 
sources responded and that one was unable to provide any information 
which could be used for validation. Thus, it cannot be said with absolute 
certainty that the model really simulates CP/CMS, in spite of the fact 
that much care was taken to represent the system faithfully. 

The importance of validation cannot be overemphasized . It is, per- 
haps, the single most important phase of model design and irrplementation. 
However, the lack of validation does not preclude using the model to make 
inferences about the system. To make such inferences one is forced to 
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resort to the use of sensitivity analysis as a tool; and, although a 
cogent argument can be made in its behalf, any results obtained by 
this method must be viewed with a measure of skepticism. Nevertheless, 
it was precisely this type of analysis that was employed in this study.. 

I. VERIFICATION 

Before any simulation experiments are made, it is necessary to verify 
the model; in other words, to ensure that the model of the system logically 
functions as expected. 

During preliminary program runs much use was made of the TRACE and 
SNAP options of GPSS. A close examination of the information provided by 
these features gave every indication that the model was behaving logically 
as intended. That is, interrupts were occurring at the proper point in 
time and were being processed as they occurred, the status of pages in 
core was being maintained properly, the algorithm for selecting CPU 
advance time was functioning correctly and, in general, all block counts 
appeared very reasonable. As expected, for example, no transactions were 
placed on the problem chain, PROB. Further, jobs in execution were trans- 
ferred to the correct location either upon causing an interrupt or upon 
being interrupted. 

Job execution times were tabulated and graphed and were hyper- 
exponentially distributed as intended, with mean execution times in the 
irrmediate neighborhood of the theoretical value. 

The possibility of performing an analysis of the queueing behavior 
of the model based upon theoretical concepts was considered and rejected 
since a Poisson arrival rate cannot be assumed when arrivals are dependent, 
upon previous results in the system [Ref. 10] . 
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Chi-square tests for goodness of fit were applied to distributions 
obtained from the functions used by the program., In all cases, the 
hypothesis was accepted at the .05 level of significance that the 
empirical distribution fit the theoretical distribution. The results 
of the tests are tabulated in Table IV. The actual data are contained 
in Appendix A. 



TABLE IV 



RESULTS OF CHI-SQUARE TESTS 
FOR 

GOODNESS OF FIT 



FUNCTION 



DEGREES 

OF 

FREEDOM 



NUMBER 

OF 

OBSERVATIONS 



CRITICAL 

VALUE 



COMPUTED 

VALUE 



EXPON 20 



412 



31.4 



17.8 



NRPAG 12 



380 



21.0 



6.74 



REACT 19 



374 



30.1 



26.5 



SEEK 



8511 



7.81 



1.25 
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IV. EXPERIMENTAL AND STATISTICAL CONSIDERATIONS 



Once the model had been constructed it remained (1) to design the 
experiment so that a sufficient quantity of information was acquired 
in a form suitable for analysis; and (2) to statistically examine that 
information in order to make, or to substantiate, any conclusions about 
the system. A review of the literature indicated that several methods 
are available for analyzing data obtained in a simulation experiment. 

The purpose of this section is to explain and justify the method used 
herein. 

A. EXPERIMENTAL CONSIDERATIONS 

Response time was selected as the primary measure of system perform- 
ance. It was not, therefore, the intent of this study to recorrcnend 
procedures for better utilizing system resources, such as core or direct 
access storage devices, except as those procedures related directly to 
improving system response time, the user's principal concern. Within 
the framework of GPSS it was a simple matter to represent response time 
as an attribute of transactions. 

Conway [Ref. 2] suggests three alternatives for measuring the attri- 
butes of temporary entities. The first method consists of fixing the 
duration of the simulation run and including in the sample the pertinent 
attribute for all transactions which terminate in that fixed interval. 

The second method is characterized by fixing both the starting time and 
the sample size and running the simulation until the desired number of 
observations is obtained, including in the sample the attributes of those 
transactions which were being processed by the system when the starting 
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time was achieved. The third method also fixes the starting time and 
the sample size but only considers the attributes for a continuous set 
of transactions which commenced processing after the starting tine was 
achieved. 

Method two was adopted; that is, the sample was composed of the 
attributes of the first thousand transactions to terminate after a 
specified time. Obviously, these transactions were not necessarily 
the first thousand to be created. Thus, the set of observations which 
comprised the sample was not determined precisely until the run actually 
ended. It is seen that this procedure may tend to bias the average 
transit time toward a more favorable result particularly if, in the 
model, it is characteristic that jobs requiring a large amount of CPU 
time never terminate during the course of the run. In the extreme, 
consider the case with 12 user terminals in which the first 11 jobs 
created require the maximum amount of processing time and, in addition, 
have high paging and I/O requirements. Thus, these 11 jobs will remain 
active in the system for a considerable period of time. It may be that 
the 12th job requires the minimum amount of CPU time and has virtually 
no I/O or paging requirements. It is conceivable that this job could 
terminate almost immediately and be replaced in the system by another 
job for that terminal with the same minimum requirements. Thus, the 
second job also may terminate quickly and it is possible that this cycle 
may continue until the specified number of observations is obtained with 
the result that the sample is restricted to only those jobs which place 
a light demand on the system. The researcher, having no reason to believe 
otherwise, may conclude, quite inaccurately, that the mean transit time is 
exceptionally good for all types of jobs. However, the use, in pilot runs, 
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of the TRACE and SNAP options enabled the authors to follow the active 
history of transactions which imposed a variety of demands upon the 
system, both heavy and light and to conclude that the situation 
described above does not occur in the model used to study CP/CMS. 

Alternative one was not selected because a minimum sample size, 
although desired, could not be guaranteed using this strategy. Simi- 
larly, alternative three was rejected since it was not felt necessary 
to eliminate from the sample the relatively small number of transactions 
which were probably created in the waning seconds of the stabilization 
period. In addition, the primary concern was to obtain an estimate for 
the mean transit time and, therefore, the manner in which individual 
observations were grouped was of no consequence. Although Conway in- 
dicates that the third strategy should assist in reducing the variability 
of the results, it is debatable whether or not these hypothetical reduc- 
tions would have been so beneficial in tightening confidence intervals as 
to warrant the expense of inserting additional code to keep track of sets 
of transactions throughout their active history. 

The matter of model equilibrium was raised; that is, the measure- 
ment of attributes should not begin until the simulation model has 
reached sane sort of steady-state, or stable condition. Clearly, the 
length of a suitable stabilization period in simulated time is a function 
of the amount of activity that occurs within the model during that sim- 
ulated time. If the basic unit of time is milliseconds and if events 
are scheduled to occur within the model every few units of time, both 
of which conditions are satisfied in this model (jobs averaged about 6-8 
milliseconds in the CPU during a unit of processing) , then one certainly 
would expect the stabilization period to be expressed in terms of seconds 
or minutes. 
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Furthermore , for the model used in this study, it was not at all 
apparent that a stabilization period was even required since the 
system itself undergoes a transient period of relative inactivity every 
time operation is commenced. Unlike job shop models, or the like, in 
which jobs remain in the system after closing hours and are present 
when business re-convenes the following day, when it becomes necessary 
to terminate time-sharing services jobs in execution are effectively lost 
and no queues for system resources remain overnight. Thus, each day the 
system starts afresh in a somewhat empty and idle condition. However, 
since it would not be reasonable to judge the merits of a time-sharing 
system on the basis of the first few minutes of operation, it was decided 
to allow for a period of "warm-up". 

Pilot runs were made in which one, two, three, four, and five minute 
stabilization periods were used. An examination of facility utilizations, 
queue lengths, and other statistics from these runs revealed that, in all 
cases, the model had stabilized by the end of the "warm-up" period. There- 
fore, although one minute would have been sufficient, a three minute period 
was chosen in order to be conservative. 

Similarly, the choice of a suitable time interval for measuring the 
attributes of permanent entities became a matter of some concern. It is 
interesting to note that resolving these issues led to the method of 
statistical analysis described in the remainder of Section IV. 

B. STATISTICAL CONSIDERATIONS 

In discussing recommended procedures for measuring the attributes 
of permanent entities, Conway states: 

Most importantly, some check on the amount of correlation should 
always be made. This should be done during pilot runs to determine 
how measurements are going to be made and tested again during the actual 
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experimentation. Usually the same dynamic character of the systen 
that forces one to resort to simulation in the first place ensures 
that correlation will be an important factor and problem in the 
investigation [Ref. 2] . 

With regard to the measurement of temporary entities, Conway says: 

Again the difficulty lies in the auto-correlation of the obser- 
vations and arises when one would use the sample variance to estimate 
the variance of the sample mean. To equate the variance of the mean 
with the sample variance divided by the number of observations contrib- 
uting to the mean, requires that the observations be mutually independ- 
ent and that is rarely the case. Temporary entites existing at the 
same time are subject to the same system conditions so that the values 
of the attributes tend to be positively and strongly correlated. The 
neglect of this correlation results in a substantial understatement 
of the variance of the mean [Ref. 2] . 

To reduce the amount of correlation between observations, and 
subsequently enhance precision, Conway recommends increasing the 
length of time intervals between successive measurements since, in 
general, a longer interval length will result in less correlation. 

1. Preliminary Analysis 



With all this in mind, the authors made the pilot runs refer- 
red to in the previous discussion to determine both a suitable stabi- 
lization period and an interval length which, on the one hand, would 
ensure equilibrium and yield essentially uncorrelated observations and, 
at the same time, could be acconmodated in view of the high execution 
ratio. 



The amount of correlation between adjacent observations, and 



X^ + ^, was estimated using the standard formula 



(N-l)ZX.X. n - ZX. ZX. X1 
l l+l l l+l 



/[(N-l)EX 2 - (ZX^ 2 ] [(N-1)ZX 2 +1 - (ZX i+1 ) 2 ] 



where the index, i, is summed from 1 to N-l in all cases, N being the 
number of observations. Initial results were disappointing indicating 
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in sane cases high correlation with relatively long intervals and low 
correlation with shorter intervals. There did not appear to be, how- 
ever, any discernible consistency in the results, which were so disparate 
at times that it was conjectured the perfect randan number generator, at 
long last, had been achieved. 

At this stage, possible approaches to the problem were (1) to 
ignore auto-correlation completely; (2) to arbitrarily increase the 
lengths of intervals to the point that it was felt the effects of auto- 
correlation would be negligible; or (3) to perform a full analysis of the 
generated data in order to use, rather than ignore, the inherent auto- 
correlation. 

The first approach is unsatisfactory since, especially in a 
scientific discipline, one at least should make an attempt to resolve 
pertinent issues rather than suspend them from consideration. Similarly, 
the second strategy tends to be nebulous and inconclusive, and may lead 
both to the discarding of valid data which should be included in the 
sample and to inordinately long computer runs. Thus, the third approach 
was taken, principally because it was the least unsatisfactory of the 
three from an experimental point of view. As it turned out, the same 
experimental results would have been obtained even if auto-correlation 
had been ignored completely, but this was not evident beforehand. 

2. Spectral Analysis 

If one were interested only in obtaining an estimate for the 
mean of a stochastic process the sample average would suffice, and it 
would be a simple matter to run the experiment until a specified number 
of observations had been acquired, calculate the sample average and quit. 
However, a problem arises when attempting to make probability statements 
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about this estimate of the mean, such as the calculation of a confidence 
interval. Traditional statistical methods generally apply to independ- 
ent observations only and, hence, cannot be used to analyze the auto- 
correlated time series of data generated by a computer simulation model. 

The method of analysis used herein was obtained entirely from the works 
of Fishman and Kiviat, [Refs. 3 and 4] , who point out that: 

The variance of the sample mean computed from a set of independ- 
ent observations is inversely proportional to the number of observations. 
This is not true for auto-correlated data. For sufficiently long sample 
record lengths, however, one may show that the variance of the sample 
mean for auto-correlated data is inversely proportional to a fraction 
of the number of observations. This fractional factor depends on the 
auto-correlation properties of the process. By analogy with the inde- 
pendent case, it seams natural to regard this fraction of the number of 
observations as the number of equivalent independent observations . 

To develop this analogy, we introduce the concept of the cor - 
relation time of a process. If a process is observed for a time inter- 
val equal to n correlation times, then one may show that, from the point 
of view of the variance of the sample mean, this time series is equiva- 
lent to collecting n/2 independent observations [Ref. 4] . 

It is the rule rather than the exception that a stochastic 
process will yield fluctuations about the mean, these fluctuations provid- 
ing a measure of the variability of that mean. Furthermore, the deviations 
generally have frequency components which can be described by examining 
the entire spectrum of the generating process. The method discussed 
previously of measuring the amount of correlation between adjacent obser- 
vations took into account only a portion of that spectrum and consequently 
yielded little information concerning the magnitude and period of any fre- 
quency components which may have contributed to the variance. 

In general, highly auto-correlated time series tend to be rela- 
tively stable, or sluggish, in the sense that major fluctuations about 
the mean seem to have rather long periods and the process is said to have 
a long correlation time. Conversely, it is characteristic of less auto- 
correlated time series that observations will fluctuate rapidly, and with 
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varying proportions, in what appears to be an unpredictable fashion. 
Frequency components may seen to be less discernible and the cor- 
relation time is said to be short. 

Spectral analysis, then, involves an examination of the 
observed data to determine the magnitude and period of any frequency 
components which contribute to the variance of the attribute ueing 
measured. On the basis of an estimate for the correlation time the 
number of equivalent independent observations is calculated. An approx- 
imate t-test can be applied to obtain confidence intervals. 

The generation of data in this simulation experiment can be 
thought of as a time-dependent covariance stationary stochastic process 
(X^_, t=0, 1, 2, . . . }. Definitions obtained from Fishman and Kiviat 
include, the mean 

y = E(X t ); 

the autocovariance function 

= E[(^_ - y) (X t+T - y)] t = 0, 1, 2 

the spectral density function 

1 

f (A) = i [1 + 2 t | 1 (R t /R 0 ) cos At] 0 s A < n; 
and the spectrum 

g (A) = R 0 f (A) . 

Clearly, R Q is derived frcm the second moments of the stochastic process 
(X t > and it is seen that if {X^} has fixed, but possibly unknown, variance 
a , tnen 

Rq = E[(X t - y) 2 ] = a 2 . 
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In general 



R 

T 



2 

o p 



T 



where p is the auto-correlation function which measures the influence 

T 

of past events in the system on the present observations. The function 
p^ lies in the closed interval [-1, 1] and equals unity when t equals 
zero. 

The parameter A used in both the spectral density function and 
the spectrum is a measure of angular frequency. As mentioned previously, 
the process {X^} is thought of as being composed of a number of fre- 
quency components and the contribution of angular frequencies around A 
2 

to the variance o is expressed in terms of the spectral density function, 
f (A) . 

Considering one simulation run as generating a time series {X^} 
of length T, the sample average is given by 



- 1 T 

X t = T Z X t 
r t=l 



and its variance is 



i T ' 1 

Var (X t ) = i [Eq + 2 I (1 

T=1 



xA)R r l 



From the expression 



Lim T Var (X ) = it g(0) = irR f (0) , 

T+oo ** u 

Fishman and Kiviat point out that the large sample variance of X^_ can 
be approximated by 

Var (X t ) -2R 0 t*/T, 

★ 

where t , the correlation time, is directly proportional to the amount of 
auto-correlation and is defined by 
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= tt£(0)/2 



* 

T 

* 

The significance of t can be seen by examining a special case. 

2 

Assuming that the sequence {X^ } is uncorrelated with variance o , 

. . — 2 
it is obvious that the variance of the sample mean, X^, is 0 /N, 

or Rq/N, where N is the number of independent observations. By 

equating this variance with the variance for the auto-correlated 

case, it is seen that 



R c/ N = 



= 2R, 



0 



* 

t /T, 



so that 



N = T/2t* 



which yields the number of equivalent independent observations in 

the auto-correlated sequence {X^} of length T with correlation time 
* 

t . Thus, the process of observing the auto-correlated sequence over 

* 

the time interval 2x is equivalent to obtaining one independent 
observation. 

Since there was no reason to expect that any two given simu- 
lation runs possessed the same amount of auto-correlation it was 
necessary to estimate separately for each run (1) the variance of the 
sample mean; (2) the correlation time; and (3) the number of equivalent 
independent observations. The following formulas were used to make 
these estimations: 



R 

T 



1 

T-t 



T-t 

Z 

t^l 



(X t - x t ) (X 



t+T 




t 



V = ^ [Rq + 2 E^ (1 - T/M) R x ] M < T— 1, 

t * = TV/ (2^) , 
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and 



N = T/(2t*) = I^/V. 

This estimate of the variance uses M instead of T-l lags, or off- 
sets, since the variance does not change measurably if more than a 
certain number, M, of the lags are used when the number of observa- 
tions is large. The appropriate value for M was chosen empirically 
for each run in the manner discussed below. 

Observations obtained in the simulation runs were supplied 
as input data to the analysis program, a listing of which appears 
following the appendices. This program calculated, for all pos- 
sible values of M, estimates for the variance, the correlation time, 
and the number of equivalent independent observations. In addition, 
estimates of the variance for each value of M were plotted. An 
example of output from the analysis program, including the graph, 
appears following the appendices. 

It can be seen by examining the sample output that the 
estimates of the variance increase in magnitude as M increases and 
reach a plateau of relative stability before finally dimiinishing. 

This increase in V is a reflection of poor resolution in the spectrum 
and is caused by using an inadequate number of lags. The effects of 
more frequency components are considered as the number of lags is 
increased and, as a result, estimates of the spectrum became more 
representative of the true spectrum. The subsequent decline of V 
represents a bias which is introduced by replacing the mean u with the 
sample average in the estimate of the autocovariance function R^. 

Fishman and Kiviat note that: 
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As M increases, the bias term clearly increases and causes 
a reduction in the estimate o r V. To avoid this, it is necessary 

to keep the number of lags M significantly smaller tnan the sample 
record length T. Bear in mind, however, that good resolution re- 
quires that M be sufficiently large so that the function g changes 
slowly at frequency tt/M [Ref. 4] . 

Using the output data as an example, it is seen that V 
stabilizes when M is in the vicinity of 29. At this point the 



estimate of the variance of is 3.473 and about 98 equivalent 

independent observations resulted from this simulation run. 

Approximate 90% confidence intervals for X were calculated 

for all simulation runs using the statistic 

X t - y X - v 



t = 



(2o 2 t*A) 1/2 




which was shown to have an approximate Student t distribution with 
1.5T/M equivalent degrees of freedom. 

Tests for significant differences in means between two runs 
were made by using the statistic 



, _ (X t,l - X t,2> ' ( U - "2 1 



t' = 

a 



^1 + V 2 > 



1/2 



which was shown to have a Student t distribution. The possibility 
that the equivalent degrees of freedom for the two runs, 1 . 5T^/M^ 
and 1.5T 2 / 1 ^ 2 , were unequal required the calculation of t' by 



t' = 

a 



(V ) ^ 2 t + (V ) t 

r l,g kv 2 [ 2,a 

(v 1 ) 1/2 + (v 2 ) 1/2 



where t n and t_ are the critical values obtained from a table of 
l,a 2, a 

the Student t distribution. The confidence interval is defined by the 
probability statement 



p[(x t,i - x t, 2 > - ^1 h 'T 1/2 4 "1 - “2 4 (x t,i - x t, 2 > 



+ t’ (v n + V 0 ) 1/2 ] : 1 - a 
a ± 2 
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Throughout this analysis it was assumed that the sequence 



{X t } was Gaussian distributed. The validity of this assumption was 
based on the fact that all observations were obtained by averaging 
the transit time of five consecutively terminating transactions. The 
results of test runs indicated that when ten transactions were grouped 
to form one observation the presence of auto-correlation in the 
sequence {X^} was obscured. To use this method of analysis, however, 
it was necessary to be able to observe the auto-correlation which was 
present and, thus, the number of transactions constituting one observa- 
tion was reduced to five. As a result of this reduction, a sequence 
of observations was obtained in which the presence of auto-correlation 
was readily apparent. 
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V. EXPERIMENTS AND RESULTS 



In order to fulfill the objectives of this study three experiments 
were performed. Experiment A investigated the behavior of the system 
under the additional load resulting frcm an increase in the number of 
terminals supported. Experiment B was designed to determine if system 
performance could be improved significantly either by more efficient 
utilization of the existing hardware or by the installation of addi- 
tional hardware. In Experiment C the effects of changes to the main 
scheduling algorithm in DISPATCH were explored. 

Although response time was the primary measure of system perform- 
ance, as mentioned in Section IV, other measures of performance were 
determined and are included in the tables of results in Appendix D. 

Among these measures is the Elapsed Time Multiplication Factor (ETMF) 
introduced by StimlerfRef. 14]. For the purposes of this study the 
ETMF is defined to be the average response time under time-sharing 
divided by the average response time if all jobs were run on a "stand- 
alone" basis. Thus, as the performance of the system improves , the 
EIMF decreases, approaching a lower limit of one. The ETMF provides a 
measure of time-sharing system performance which differs from response 
time in one important respect. If the average amount of CPU time and/or 
I/O time required by a group of jobs increases, their average response 
time naturally increases by at least an equal amount, even if the system 
is completely efficient. On the other hand, these same increased require- 
ments would not cause any increase in the ETMF. Thus, the ETMF provides 
a useful means of comparing system performance in experimental runs where 
the CPU and I/O rates differ. A discussion of haw the EIMF was calculated 
here is given in Appendix C. 
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A. EXPERIMENT A 



In order to measure system degradation under an increased load, 
four series of five runs each were made. Within each series the 
conditions were identical with the exception of the number of ter- 
minals which, after an initial run with 12 terminals, was raised 
from 15 to 30 in steps of five. Between series of ruins the I/O 
rate and paging interrupt rate distributions were varied. The mean 
rates used in all experiments are tabulated in Table V. 



TABLE V 

MEAN I/O AND PAGING INTERRUPT RATES 

DESCRIPTION 
LOW I/O 
HIGH I/O 
LOW PAGING 
HIGH PAGING 



MEAN RATE 

24.10/sec. 

36.36/sec. 

22.73/sec. 

38.46/sec. 



MEAN INTERVAL 

41.5 msec. 

27.5 msec. 

44.0 msec. 

26.0 msec. 



The results of Series A1-A4, Experiment A, are summarized in 
Tables VI-IX respectively. Figures 3 and 4 present graphical displays 
of the observed response times and calculated ETMFs respectively. The 
cumulative distributions for response times of Series A3 are pictured 
in Figure 5. It was observed that in each series average response 
times increased as the number of terminals increased with the exception 
of the first two runs of Series Al, where the average response time 
decreased as the number of terminals was increased from 12 to 15. The 
differences between the average response times in each series were tested 
for significance using the method described in Section IV and, except 
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for three, were found to be significant at the .10 level. The three 
differences which were not significant were (1) the decrease observed 
between runs Al.l and A1.2; (2) the increase between runs A2.4 and 
A2.5; and (3) the increase between runs A4.1 and A4.2. It also was 
noted that, within each series, the ETMF increased as the number of 
terminals increased, with a much greater relative increase in Series 
A3 and A4, thus indicating a more pronounced degradation of perform- 
ance when I/O rates were high. 

The number of pages read per second increased as the number of 
terminals increased for all four series. This effect might have been 
attributed entirely to the fact that the percentage of execution and 
thus, the number of paging interrupts occurring per second also was 
increasing. However, it was observed after detailed analysis that 
even though the number of paging interrupts occurring per second of 
execution (paging interrupt rate) remained constant within each series, 
the number of pages read per second of execution increased considerably 
within each series. In other words, the percentage of paging inter- 
rupts which resulted in a page being read was increasing. Therefore, 
it was concluded that the expected increase in paging activity definitely 
was present and undoubtedly contributed to the increase in response time 
and ETMF. 

Although all preliminary runs had indicated that the model was stable. 
Series A5 was run to verify the stability further. Each of the three runs 
was a duplicate of a previous run but with different seeds for the eight 
randan number generators. Run A5.1 was a repeat of run Al.l; run A5.2 
was a repeat of run A2.1; and run A5.3 was a repeat of run A2.5. The 
results of these runs are tabulated in Table X, Appendix D. As was 
expected, there were no marked differences between these runs and their 
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corresponding previous runs, and their average response times fell 
well within the 90% confidence intervals established by the previous 
runs. (See Figure 6.) Comparison of the corresponding average 
response times showed no significant differences at the .10 level. 

It also can be seen frcm Figure 7 that there were no appreciable 
differences in the corresponding ETMFs. 

B. EXPERIMENT B 

The purpose of Experiment B was to test whether or not an improve - 
ment in response time would result frcm (1) more efficient utilization 
of existing DASDs; or (2) addition of more DASDs. The first hypothesis 
was tested in Series B1 and the second in Series B2. 

As noted earlier, the paging drum can store up to 15 virtual machines. 
Since a maximum of only 12 virtual machines is required with 12 ter- 
minals there exists room for approximately 200 additional pages on the 
drum. It was felt that response time might be improved if this space 
could be utilized to store the most frequently referenced pages frcm 
the CMS disk. It was determined that 40% of the disk resident CMS pages 
could be moved to the drum. Furthermore, it was assumed that these 200 
pages could be chosen in such a manner as to account for at least 80% of 
the requests for CMS pages. The minor alterations to the model which 
were necessary to reflect this change to the system were incorporated in 
runs Bl.l through B1.4. 

Another, perhaps more obvious, area for improvement was in the assign- 
ment of 2311 disk drives. With 15 or fewer terminals the paging disk was 
not used and, in addition, because the CMS disk was used to store only 
the 500 system pages, approximately 75% of its capacity was unused. The 
DISK function in the model was changed to make this unused space avail- 
able for I/O operations in the runs of Series Bl. 
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The results of Series Bl, tabulated in Table XI, Appendix D, 
were compared with the corresponding runs in Experiment A. The 
comparison of average response times is presented graphically in 
Figure 8. It was found that only between runs A4.1 and B1.4 was 
there a significant reduction in average response time. Further- 
more, although the EIMFs for the runs of Series Bl were, in every 
case, lower than those of their corresponding runs in Experiment A 
the differences were slight, as depicted in Figure 9. 

In Series B2 the effect of adding four additional I/O disks to 
the system was investigated. This alteration was particularly 
interesting since, at the time the runs were made, this modification 
actually was being planned. The DISK function in the model was 
changed to distribute the I/O requests uniformly over six disks, and 
four runs were made. These runs, tabulated in Table XII, Appendix 
D, were identical to their corresponding runs in Experiment A with 
the exception of the increased number of I/O disks. 

The results of these runs were encouraging. As can be seen in 
Figure 10, average response times decreased significantly for runs 
B2.2, B2.3, and B2.4. Although the response time for run B2.1 in- 
creased slightly, the increase was not significant. Furthermore, as 
the bar graph in Figure 11 shows, the EIMF was decreased in every 
case. It also was observed that the percentage of execution increased 
noticeably in runs B2.3 and B2.4 

C. EXPERIMENT C 

Experiment C was designed to test the effect on response time of 
a new scheduling algorithm. The algorithm used was a modified version 
of one which is incorporated in newer versions of CP/CMS than the one 
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modeled. The algorithm is designed to reduce overhead and paging load 
by limiting the number of jobs which simultaneously can compete for 
system resources. At any given time there are only two groups of jobs 
which can compete for resources. Group 1 and Group 2. The maximum 
number of jobs allowed in Group 1 at any time is N^, and the maximum 
for Group 2 is N 2 . Thus, a maximum of + N 2 jobs can vie for system 
resources. Any remaining jobs are in queues waiting to join either 
Group 1 or Group 2. 

When a user completes a terminal interaction and the number of jobs 
in Group 1 is less than the user's job is assigned a time slice of 
400 milliseconds and is placed in Group 1. However, if the number of 
jobs in Group 1 is equal to the job enters a FIFO queue of jobs 

waiting to join Group 1. When a job in Group 1 finishes its 400 milli- 
second time slice and the number of jobs in Group 2 is less than N 2 
the job is assigned a time slice of five seconds and is placed in Group 
2. However, if the number of jobs in Group 2 is equal to N 2 the job 
enters a FIFO queue waiting to join Group 2. When a jcb completes its 
five second time slice it is placed at the end of the queue for Group 2. 

If possible the scheduling algorithm selects a runnable job for 
execution frcm Group 1. If there are no runnable users in Group 1 a 
job in Group 2 may be selected. If neither group contains runnable jobs, 
no job is started, even though there may be runnable jobs waiting in the 
queues to join either group. 

After the appropriate changes were made in the model to reflect the 
new scheduling algorithm, two series of runs were made. The values as- 
signed to and N 2 were varied among the runs to measure their effect, 
if any, on system performance. In both series the I/O rate and paging 
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interrupt rate were both set to high. The run conditions and results 
for Series Cl and C2 are summarized in Tables XIII and XIV respectively, 
in Appendix D. 

The average response times of Series Cl and C2 runs are depicted 
graphically in Figure 12, along with the average response times of 
previous experiments with the same conditions. The analysis of results 
revealed that at a .10 level of significance there were no differences 
between the runs of Experiment C and their corresponding previous runs. 
Furthermore, it was found that the average response times from those 
runs in Experiment C which differed only in the values assigned to 
and ^ were not significantly different. 
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VI. CONCLUSIONS 



As mentioned earlier in this paper, it was not possible to 
validate the model used in the analysis of CP/CMS and, therefore, 
any conclusions based on the results of experiments performed with 
that model are subject to question. Nevertheless, it is felt that 
even though the model was not validated it is definitely similar to 
the system in most respects and, thus, should reflect the system's 
behavior to seme extent. 

The results obtained from Experiment A indicated that, for all 
job mixes considered, there was a substantial jump in the ETMF 
between 15 and 20 terminals, and for greater than 20 terminals both 
ETMF and response time increased rapidly. It also was observed that 
the increase in ETMFs between 12 and 15 terminals was relatively small 
in most cases and, furthermore, it was noted that in two of the four 
series of runs there was not a significant increase in response time 
between 12 and 15 terminals. It was concluded, therefore, that 15 
terminals could be supported without serious degradation of response 
time. 

The extremely high I/O disk utilization figures observed for 20 and 
more terminals supported the hypothesis that the sharp increase in ETMFs 
and response times was caused principally by the increased amount of jobs 
which were required to wait in the queue for I/O disks. This hypothesis 
was supported when it was calculated that in runs A3. 3 and A4.3 each job 
spent an average of 8.6 and 7.3 seconds, respectively, in the queues 
for I/O disks. 
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Although the more efficient use of DASDs in Series Bl caused a 
significant reduction in response time in only one of the five runs, 
the ETMFs for the three runs with high I/O rates were reduced by a 
minimum of 13%. It appears, therefore, that although this experiment 
did not show a general significant reduction in response times, there 
is still a strong possibility that system performance could be enhanced 
by the proposed changes. 

More definite results were (obtained when Series B2 runs were made 
with six I/O disks. Three of the four runs showed statistically signif- 
icant improvement in response times and all four of the runs exhibited 
a marked decline in ETMFs. The reduction in average I/O disk utiliza- 
tion observed in the 20 terminal runs suggested that the additional I/O 
disks appreciably reduced high I/O disk queueing times which were observed 
in runs A3. 3 and A4.3. These results indicate that the system could 
adequately support 20 terminals with the additional four I/O disks. 

The scheduling algorithm tested in Experiment C appeared to have 
little or no effect on either response time or ETMF when and N^, the 
Group 1 and Group 2 sizes, were both equal to six. However, when the 
Group 2 size was changed to three for run C2.1 and to nine for run C2.2 
a noticeable decrease occurred in both response times and ETMFs. Even 
though the decrease in response time was not statistically significant, 
the fact that there was an appreciable decrease in these runs, where none 
was observed in runs Cl.l and Cl. 3, suggested the possibility that the 
efficiency of the scheduling algorithm is extremely sensitive to the values 
selected for and N^. 
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In summary, the results of this study indicated that the perform- 
ance of the Naval Postgraduate School time-sharing computer system is 
limited by its small disk I/O capability, and an increased number of 
disk drives are required for the system to support a large increase 
in the number of attached terminals. 



84 



APPENDIX A 



DATA FOR 
CHI-SQUARE TEST 
FOR 

GOODNESS OF FIT 



FUNCTION: NRPAG N = 380 



INTERVAL 


PROBABILITY 

( V 


EXPECTED 

< Ej) 


OBSERVED 

( 0j) 


0.5- 1.5 


.20 


76 


77 


1.5- 2.5 


.30 


114 


116 


2.5- 3.5 


.06 


22.8 


28 


3.5- 4.5 


.05 


I 

19 


19 


4.5- 5.5 


.04 


15.2 


12 


5.5- 6.5 


.04 


15.2 


16 


6.5- 7.5 


.04 


15.2 


19 


7.5- 8.5 


.03 


11.4 


12 


8.5- 9.5 


.02 


7.6 


10 


9.5-10.5 


.02 


7.6 


8 


10.5-16.5 


.06 


22.8 


18 


16.5-25.5 


.04 


15.2 


10 


25.5-57.5 


.10 


38 


35 
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FUNCTION: REACT 






N = 374 


INTERVAL 


PROBABILITY 


EXPECTED 

(E.) 

3 


OBSERVED 

(0.) 

3 


0- 2000 


.046 


17.20 


23 


2000- 4000 


.084 


31.42 


45 


4000- 6000 


.105 


39.27 


38 


6000- 8000 


.126 


47.12 


48 


8000- 10000 


.096 


35.90 


25 


10000- 12000 


.082 


30.67 


25 


12000- 14000 


.049 


18.33 


15 


14000- 16000 


.041 


15.33 


18 


16000- 18000 


.036 


13.46 


16 


18000- 20000 


.031 


11.59 


18 


20000- 22000 


.025 


9.35 


13 


22000- 24000 


.023 


8.60 


11 


24000- 26000 


.020 


7.48 


4 


26000- 28000 


.018 


6.73 


4 


28000- 30000 


.017 


6.36 


5 


30000- 35000 


.038 


14.21 


15 


35000- 60000 


.030 


11.22 


9 


60000-100000 


.033 


12.34 


14 


100000-200000 


.050 


18.70 


17 


200000-300000 


.100 


18.70 


11 
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FUNCTION: EXPON 






N = 412 


INTERVAL 


PROBABILITY 


EXPECTED 


OBSERVED 


(p j> 


< Ej) 


(0 j) 


0 


.005 


2.06 


0 


0- 100 


.043 


17.72 


21 


100- 200 


.047 


19.36 


15 


200- 300 


.044 


18.13 


25 


300- 400 


.042 


17.30 


15 


400- 500 


.040 


16.48 


18 


500- 600 


.038 


15.66 


20 


600- 700 


.037 


15.24 


15 


700- 800 


.033 


13.60 


16 


800- 900 


.033 


13.60 


19 


900- 1000 


.032 


13.18 


14 


1000- 2000 


.236 


97.23 


101 


2000- 3000 


.148 


60.98 


60 


3000- 4000 


.086 


35.43 


29 


4000- 5000 


.054 


22.25 


19 


5000- 6000 


.032 


13.18 


11 


6000- 7000 


.020 


8.24 


8 


7000- 8000 


.011 


4.53 


3 


8000- 9000 


.008 


3.30 


1 


9000-10000 


.004 


1.65 


2 


10000-16000 


.007 


2.88 


0 
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FUNCTION: SEEK 


N = 8511 


INTERVAL 


PROBABILITY 

‘V 


EXPECTED 

«D> 


OBSERVED 

(0.) 


25- 65 


.19 


988.19 


1016 


65- 75 


.45 


2340.45 


2296 


75-108 


.28 


1456.28 


1495 


108-135 


.08 


416.08 


394 



The formula applied to arrive at the computed value in all cases 



was 



2 <0 l-¥ 2 . (0 2- E 2 )2 

X = — ? + 5 



+ * * • + 



(0 » - ^ 



or 



2 ? W 

X = 2 — J J - 



j=l 



E d 



where 



E. = NP. 
3 3 



Results of these computations are tabulated in Table XV, Section III. 
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APPENDIX B 



CALCULATION 

OF 

CONFIDENCE INTERVALS 



The following calculation uses data obtained from the sample out- 
put, which corresponds to run A3. 4. 

The sample average is given by 
X^_ = 19.195 seconds, 

and the estimate for the variance using 29 lags, is seen to be 
V = 3.473. 

The equivalent degrees of freedom are obtained frcm 
EDF = 1.5(200/29) 

= 10.3. 

Frcm a table of the Student t distribution, we have 
t 95 = 1.81, 

so that the confidence interval given by the probability statement 

.1/2 _ a/2 

P(X t - t_ 95 V < u < X t + t_ 95 V ) = .90, 

is seen to be 



19.195 + 3.374 or (15.821, 22.569). 



Using runs A4.1 and B2.2 as an example of the test for difference in 
means, the following data are observed: 

X ^ = 8.493 seconds 



X^ = 5.487 seconds 
V, = 0.52362 
V 2 = 0.16996 



a/2 

V = 0.72360 

a/2 

V 2 = 0.41223 
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Fran a table of the Student t distribution we have 



so that 



= 2.02 
t 2 = 1.68 

. 1/2 

V ± = 1.462 

a/2 

t 2 V 2 = 0.688 



and the statistic t' is calculated as 

t' = (1.462 + 0.688)/(0. 72360 + 0.41223) 
= 2.150/1.136 
= 1.893. 



Further, 



and 



1/2 1/2 

(V 1 + V 2 ) = (°- 69358) ' 

= 0.8328, 



X 1 - X 2 = 8.493 - 5.487 



= 3.006 seconds, 

which is well outside the interval of acceptance given by 



P(-l. 893(0. 8328) < ^ <. + 1.893(0.8328)) = .90. 

Thus, the hypothesis is rejected at the .10 level of significance that 
the means are equal. 
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APPENDIX C 



CALCULATION 
OF THE 

ELAPSED TIME MULTIPLICATION FACTOR 



The Elapsed Time Multiplication Factor, or ETMF, is defined by 
ETMF = R/T, 

where R is the average response time and T is the average time to 
complete a task when it is run alone. When computer programs are 
run as "stand alone" tasks T is given by 

T = E + I, 

where E is the average CPU time required and I is the average I/O 
time required. If IR is the average rate of I/O requests per second 
of execution and S is the average I/O service time, then 

I = E (IR) S. 

Substituting in the above equations yields 

T = E(1 + (IR) S) , 



and 



ETMF = R/(E(1 + (IR) S) ) . 

The values of R and E were obtained from each experimental run. 

IR, the average I/O request rate for a given run, was approximated by 
the mean of the appropriate I/O request rate distribution. Similarly, 

S, the average I/O service time, was approximated by the sum of the 
means of the seek time distribution (given by the function SEEK) and 
the rotational and transmission delay distribution (given by the variable 
DIOT) . Preliminary runs indicated that these approximations were very 
good. 
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As an example, for run Bl.2 the following values were obtained frctn 
Tables V and IX: 

R = 4.389 seconds 
E = 872 msec. = 0.872 seconds 
IR = 24.10/second. 



Furthermore , 



Therefore , 



S = mean of SEEK + mean of DIOT 
= 0.075 + 0.038 
= 0.113 seconds. 



ETMF = 



4.389 



0.872(1 + (24.10) (0.113)) 



= 1.35. 
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TABULATED EXPERIMENTAL RESULTS 
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TABLE VI (continued) 
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RESULTS OF SERIES A2 RUNS 
(LOW I/O, HIGH PAGING) 
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TABLE VII (continued) 
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RESULTS OF SERIES A3 RUNS 
(HIGH I/O, LOW PAGING) 
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7.915) 10.441) 17.113) 22.569) 36.947) 



TABLE VIII (continued) 
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RESULTS OF SERIES A4 RUNS 
(HIGH I/O, HIGH PAGING) 
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FOR AVERAGE RESPONSE (7.031, (7.165, (10.776, (20.336, (28.583, 

TIME 9.955) 11.805) 15.272) 25.984) 38.199) 



TABLE IX (continued) 
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RESULTS OF SERIES A5 RUNS 
(CHECK OF STABILITY) 
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(SEC.) 4.903 4.363 14.888 



TABLE X (continued) 
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RESULTS OF SERIES Bl RUNS 
(CMS PAGES ON DRUM & 3.75 I/O DISKS) 
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RESPONSE TIME (SEC.) 4.751 4.389 5.824 5.931 8.577 



TABLE XI (continued) 
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RESULTS OF SERIES B2 RUNS 
(SIX I/O DISKS) 
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AVERAGE RESPONSE TIME (SEC.) 6.748 5.487 7.986 8.203 



TABLE XII (continued) 
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RESULTS OF SERIES Cl RUNS 
(NEW SCHEDULING ALGORITHM) 
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AVERAGE RESPONSE TIME (SEC.) 8.073 13.892 34.354 



TABLE XIII (continued) 
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RESULTS OF SERIES C2 RUNS 
(NEW SCHEDULING ALGORITHM) 
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AVERAGE RESPONSE TIME (SEC.) 7.296 30.028 8.369 



TABLE XIV (continued) 
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APPENDIX E 



FLOWCHARTS OF THE MODEL 
INITIALIZATION, TIMING 8 OUTPUT 
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PAGING 

A. CORE STATUS TABLE LOOK-UP 
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13 ABSTRACT 



A GPSS model of the CP/CMS time-sharing computer system at the Naval Postgraduate 
School was constructed, and was used in three experiments to investigate the perform- 
ance of the system under a variety of conditions. In each of the experiments the 
model generated auto-correlated sequences of observations which were analyzed using 
techniques adapted from spectral analysis. 

An experiment to determine the effect of an increased number of terminals on 
average response time revealed that the system adequately could support a 25% increase 
in the number of terminals, and that the number of terminals was limited by the 
magnetic disk I/O capability. In a second experiment it was found that increasing 
the number of disks from four to eight would enable the system to support up to 20 
terminals. A final experiment involving the examination of a new scheduling algorithm 
shewed no significant charges in average response times. 
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